Morphological Segmentation of Nouns Using an Inductive Logic Programming System

نویسندگان

  • ARBANA KADRIU
  • LULE AHMEDI
  • LEJLA ABAZI
چکیده

One of the most explored fields of NLP is morphology. It is important because language is productive: in any given text we will encounter text words an word forms that we haven’t seen before and that are not in a precompiled dictionary. The core task of computational morphology is to take a word as input and produce a morphonological analysis for it. There are a lot of approaches in formalizing and automatically finding rules that implement the morphology of a language. One approach is through decision lists. Decision lists can be employed as a representation language for a wide range of tasks. Clog is a system that is based on this logic and that has been primarily developed with natural language applications in mind. So far, Clog has been successfully employed for morphology learning tasks. The research presented in this paper is about machine learning of the rules of nouns morphology in Albanian. Starting point of the research presented here are 7 lists, each consisting of 6000 pairs of feminine nouns in Albanian. The pairs in the lists hold the noun’s base form and inflected form. As base word is considered the definite singular of nominative. Key-Words: morphological segmentation, nouns, inductive logic programming, Albanian language

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The segmentation problem in morphology learning

Recently there has been a large literature on various approaches to learning morphology, and the success and cognitive plausibility of different approaches (Rumelhart and McClelland (1986), MacWhinney and Leinbach (1991) arguing for connectionist models, Pinker and Prince (1988), Lachter and Bever (1988), Marcus et al. (1992) arguing against connectionist models, Ling and Marinov (1993), Ling (...

متن کامل

Robust Potato Color Image Segmentation using Adaptive Fuzzy Inference System

Potato image segmentation is an important part of image-based potato defect detection. This paper presents a robust potato color image segmentation through a combination of a fuzzy rule based system, an image thresholding based on Genetic Algorithm (GA) optimization and morphological operators. The proposed potato color image segmentation is robust against variation of background, distance and ...

متن کامل

A Hybrid Approach to Word Segmentation

This article presents a combination of unsupervised and su pervised learning techniques for generation of word segmentation rules from a list of words First a bias for word segmentation is introduced and a simple genetic algorithm is used for the search of segmentation that corresponds to the best bias value In the second phase the seg mentation obtained from the genetic algorithm is used as an...

متن کامل

Learning Morphological Rules for Amharic Verbs Using Inductive Logic Programming

This paper presents a supervised machine learning approach to morphological analysis of Amharic verbs. We use Inductive Logic Programming (ILP), implemented in CLOG. CLOG learns rules as a first order predicate decision list. Amharic, an under-resourced African language, has very complex inflectional and derivational verb morphology, with four and five possible prefixes and suffixes respectivel...

متن کامل

An Automated MR Image Segmentation System Using Multi-layer Perceptron Neural Network

Background: Brain tissue segmentation for delineation of 3D anatomical structures from magnetic resonance (MR) images can be used for neuro-degenerative disorders, characterizing morphological differences between subjects based on volumetric analysis of gray matter (GM), white matter (WM) and cerebrospinal fluid (CSF), but only if the obtained segmentation results are correct. Due to image arti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010